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Abstract 



O 

D ' In this paper we propose a new four-parameters distribution with increasing, decreasing, bathtub- 

n ■ 

shaped and unimodal failure rate, called as the exponentiated Weibull-Poisson (EWP) distribu- 

CN I tion. The new distribution arises on a latent complementary risk problem base and is obtained by 

compounding exponentiated Weibull (EW) and Poisson distributions. This distribution contains 

several lifetime sub-models such as: generalized exponential-Poisson (GEP), complementary 

Weibull-Poisson (CWP), complementary exponential-Poisson (CEP), exponentiated Rayleigh- 

^, 1 Poisson (ERP) and Rayleigh- Poisson (RP) distributions. 

We obtain several properties of the new distribution such as its probability density func- 
> ■ 

\^ . tion, its reliability and failure rate functions, quantiles and moments. The maximum likelihood 

00 ' 

ly-^ ' estimation procedure via a EM-algorithm is presented in this paper. Sub-models of the EWP 



distribution are studied in details. In the end. Applications to two real data sets are given to 
show the flexibility and potentiality of the new distribution. 
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estimation, Probability weighted moments. Residual life function. 
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1. Introduction 

The Weibull and EW distributions provide a simple and close form solution to many prob- 
lems in lifetime and reliability. However, they do not provide a reasonable parametric fit for 
some practical applications, for example the Weibull distribution is not useful for modeling 
phenomenon with non-monotone failure rates. The bathtub-shaped and the unimodal failure 
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rates which are common in rehabihty and biological studies cannot be modeled by the Weibull 
distribution. 

Recently, attempts have been made to define new families of probability distributions that 
extend well-known families of distributions and at the same time provide great flexibility in mod- 
eling data in practice. The exponential-geometric (EG), exponential-Poisson (EP), exponential- 
logarithmic (EL), exponential-power series (EPS), Weibull-geometric (WG) and Weibull-power 
series (WPS) distributions were introduced and studied by Adamidis and Lojrkas Kus 23], 



Tahmasbi and Rezaei Issl . Chahkandi and Ganjali [3], Barreto-Souza et al. 0] 



and Morals and 



Barreto-Souza et al. 



3l|], respectively. ^ 

Barreto-Souza and Cribari-Neto [5] and Louzada et al. [24] introduced the exponentiated 
exponential-Poisson (EEP) and the complementary exponential-geometric (CEG) distributions 
where the EEP is the generalization of the EP distribution and the CEG is complementary to 
the EG model proposed by Adamidis and Loukas Q]. Recently, Cancho et al. [9] introduced 
the two-parameter Poisson-exponential (PE) lifetime distribution with increasing failure rate. 



Mahmoudi and Jafari 



251 ] introduced the generalized exponential-power series (GEPS) distri- 



bution by compounding the generalized exponential (GE) distribution with the power series 
distribution. Also exponentiated Weibull-logarithmic (EWL), exponentiated Weibull-geometric 



(EWG) and exponentiated Weibull-power series (EWP) distributions has 



jeen introduced and 



analyzed by Mahmoudi and Sepahdar 



261 ] and Mahmoudi and Shiran 



27 



28|. 



In this paper, we propose a new four-parameters distribution, referred to as the EWP distri- 
bution, which contains as special sub-models the GEP, CWP, CEP, ERP and RP distributions, 
among others. The main reasons for introducing the EWP distribution are: (i) This distribution 
due to its flexibility in accommodating different forms of the risk function is an important model 
that can be used in a variety of problems in modeling lifetime data, (ii) This distribution is 
a suitable model in a complementary risk problem base (Basu and Klein, [^) in presence of 
latent risks, in the sense that there is no information about which factor was responsible for 
the component failure and only the maximum lifetime value among all risks is observed, (iii) 
It provides a reasonable parametric fit to skewed data that cannot be properly fitted by other 
distributions and is a suitable model in several areas such as public health, actuarial science, 
biomedical studies, demography and industrial reliability. 

The paper is organized as follows. In Section 2, we review the EW distribution and its 



2 



properties. In Section 3, we define the EWP distribution. The density, survival and hazard 
rate functions and some of their properties are given in this section. We derive quantiles and 
moments of the EWP distribution in Section 4. Renyi and Shannon entropies of the EWP 
distribution are given in Section 5. Section 6 provides the moments of order statistics of the 
EWP distribution. Residual and reverse residual life functions of the EWP distribution are 
discussed in Section 7. In Section 8 we explain probability weighted moments. Mean deviations 
from the mean and median are obtained in Section 9. Section 10 is devoted to the Bonferroni and 
Lorenz curves of the EWP distribution. Estimation of the parameters by maximum likelihood 
via a EM-algorithm and inference for large sample are presented in Section 11. In Section 12, 
we studied some special sub-models of the EWP distribution. Applications to real data sets are 
given in Section 13 and conclusions are provided in Section 14. 



2. EW distribution: A brief review^ 



32(1 as extension of the 



The EW distribution, introduced by Mudholkar and Srivastava 
Weibull family, contains distributions with bathtub-shaped and unimodal failure rates besides 
a broader class of monotone failure rates. The applications of the EW distribution in reliab ility 



and survival studies were illustrated by Mudholkar et al. )33[| . Mudholkar and Huston 34] 



Gupta and Kundu [3], Nassar and Eissa 



3a] and Choudhury 
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The random variable X has an EW distribution if its cumulative distribution function (cdf) 
takes the form 



Fx{x) 



1 



X > 0, 



where a > 0, /3 > and 7 > 0. The corresponding probability density function (pdf) is 



a-l 



(1) 



(2) 



The survival and hazard rate functions of the EW distribution are 



and 



Six) 



h{x) = a7/3Ta;^-ie-(^^)'' ( 1 - e 



-(/3x)- 



1 



1 



-{I3xy 



-1 



respectively. The kth. moment about zero of the EW distribution is given by 



Note that for positive integer values of a, the index j in previous sum stops at a — 1 and the 
above expression takes the closed form 

when ^ 

^fc(7) = 1 + (" ~j ^) + A; = 1, 2, 3, • • • , 



where r(^ + 1) denotes the gamma function (see, Nassar and Eissa 36] for more details). 

3. The EWP distribution 

Suppose that the random variable X has the EW distribution where its cdf and pdf are 
given in (1) and (2). Given N, let Xi,--- ,X]\f be independent and identify distributed ran- 
dom variables from EW distribution. Let is distributed according to zero truncated Poisson 
distribution with pdf 

P{N = n) = — — n = l,2,-- - , 0>O. 

n!(l — e "^j 

Let Y = max(Xi, • • • , Xi\[), then the cdf of Y\N = n is given by 

which is EW distribution with parameters na, /3 and 7. The EWP distribution, denote by 
EWP(a, /3, 7, 9), is defined by the marginal cdf of Y, i.e. 

ny) = ^^31 • (3) 

This new distribution includes some sub-models such as CEP, GEP, GWP, ERP and RP distri- 
butions. 

The pdf of EWP distribution is given by 
where a, I3,j,0 > 0. 

The survival and hazard rate functions of EWP distribution are given, respectively, by 

e(l_e-(/3!/)^)" _ 

S{y) = 1 - ^ ^- , (5) 

e" — 1 
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Figure 1: Plots of density function of EWP distribution for P — 1 and different values a, 9 and 7. 



and 



g6» _ g9(l-e-(/3!')^)" 

The plots of density and hazard rate functions of EWP distribution for ^ 
values (a, 7,^) are given in Figs. 1 and 2, respectively. 

Proposition 1. The limiting distribution of EWP (a, /3,7, 0) where — )• 0+ is 

lim F(y) = (l-e-^'^y^^T, 
which is the cdf of EW distribution. 



(6) 

1 and different 



Proposition 2. The limiting behavior of hazard function of EWP distribution in 

00 < a < 1 

and liniy^^oo h{y) = 0. 

a > 1, 
00 < Q < 1 

a = 1 and limy_^oo Kv) = 
a > 1, 

(iii) for 7 > 1, liiHy^Q h{y) = 0, for each value a > and lim^^oo ^{v) = oo. 



IS 



(i) for < 7 < 1, lim^^o Hv) 



(ii) for 7 = 1, liuiy^o h{y) = < 
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Figure 2: Plots of hazard function of EWP distribution for f} — 1 and different values a, 9 and 7. 
Proof. The proof is a forward calculation and is omitted. 

4. Quantiles and moments of the EWP distribution 

Some of the most important features and characteristics of a distribution can be studied 
through its moments and quantiles such as tending, dispersion, skewness and kurtosis. Also, the 
quantiles of a distribution can be used in data generation from a distribution. 

The pth quantile of the EWP distribution is given by 



which is used for data generation from the EWP distribution. 

Now we obtain the moment generating function of the EWP distribution. Suppose that Y ~ 
EWP{a, f3,'j,9) and = max{Xi,--- where Xi ~ EW{a, l3,'y) for i = 1, 2, • • • , n. 
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then 



Mvit) =En=i m = n)Mx,Jt) 



= (^Er=iEr=oEr=o(-iy (f^^rT')(^' + i)"^''''^r(i + f). (s) 

One can use My(t) to obtain the kth moment about zero of the EWP distribution. We have 

Using the change of variable i = n — 1, another equivalent formula for E{Y^) is given by 

The mean and variance of the EWP distribution are given, respectively, by 

and 

Var{Y) = ^Si|Er=iEr=o(-lP(S^(T')(^' + l)"^'^'^ -^'(^)' (1^) 
where E{Y) is given in Eq. ()lip . Note that for positive integer values of a, the index j in Eqs. 
©-(HID stops aXna-l. 



5. Renyi and Shannon entropies 

If X is a random variable having an absolutely continuous cumulative distribution function 
F[x) and probability distribution function /(x), then the basic uncertainty measure for distri- 
bution F (called the entropy of F) is defined as H{x) = log(/(X))]. Statistical entropy is 
a probabilistic measure of uncertainty or ignorance about the outcome of a random experiment, 
and is a measure of a reduction in that uncertainty. Numerous entropy and information indices, 
among them the Renyi entropy, have been developed and used in various disciplines and con- 
texts. Information theoretic principles and methods have become integral parts of probability 
and statistics and have been applied in various branches of statistics and related fields. 



Entropy has been used in various situations in science and engineering. The entropy of a 
random variable y is a measure of variation of the uncertainty. For a random variable with the 
pdf /, the Renyi entropy is defined by lR{r) = IosI/k f^{y)(^y}^ for r > and r 7^ 1. Using 
the power series expansion (1 — z)"- = X]j^o(~^)"' (j)-^"' ^'^^ change of variable {l3yY = u gives 

Using formula /o^y"(^-i)e-('-+^)(/^2^)"dy = r( '-(^-^)+i )(^/?'-(7-i)+i))-i(^ + fc)~^''^'^\ we have 

Thus, according to the definition of Renyi entropy we have 

lR{r) = -^log 
1 — r 

The Shannon entropy is defined by E[— log[/(y)]] . This is a special case derived from lim^^i lR{r). 

6. Moments of order statistics 

Order statistics make their appearance in many areas of statistical theory and practice. 
Moments of order statistics play an important role in quality control testing and reliability, 
where a practitioner needs to predict the failure of future items based on the times of a few 
early failures. These predictors are often based on moments of order statistics. We now derive 
an explicit expression for the density function and cumulative distribution function of the rth 
order statistic 1^^, in a random sample of size n from the EWP distribution. 

Let the random variable Yr;n be the rth order statistic (Yi-n < < • • • < ^:n) in a 
sample of size n from a distribution with pdf /(.) and cdf F{.), respectively. The pdf of Yr;n for 
r = 1, • • • , n is given by 

^-^^^ = Bir,n-r + l) ^^y^ t^^^^J'"' " ^^^^J"""' ^ > ^^^^ 
Substituting from ([3]) and ([!]) into (fT3]) and using the binomial expansion gives 

Jr:n\y) — B{r,n-r+l) {eH-l)"^ ^ \^ J 

X jy.z^ Ej-o(-l)''"*~^~n''7^) ("T'')e("-''-J')^e(*+^')''(^~^~"''''^)". (14) 
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(e« 



-IV 



1) + 1. 



EE 

j=0 k=0 



(-l)'(^ 



a{r + j) 
k 



+ k) 



r(7-l) + l 



Also the cdf of Yr-n is given by 



= EL. Eto a) rT'=)e("-'=-^-)^e(^+^-)^(i— ^^^'^)". (15) 

Using the binomial expansion, after some calculations the kth. moment of the rth order statistic 
Yr:n is given by 

ad -p { k j_ ^n-r (-1)' ^ _-^^r+i-j-l (r+j~l\ 



X EEo ^ E^o(-i)*r^. + 1)"^'^^^- (16) 

The pdf and cdf of the smallest and biggest order statistics Yi-n and Yn-.n can be obtained using 
Eqs. (fH|) and ([T5|) for special cases r = 1 and n. 
For the smallest order statistics Yimj we have 

(e 1)"" V / 

and for the biggest order statistics Yn-n^ we have 

The expectation of the smallest and biggest order statistics Yi-n and Yn-n can be obtained by 
setting r = 1 , n and A; = 1 in (jl6p . 



7. Residual life function of the EWP distribution 

Given that a component survives up to time t > 0, the residual life is the period beyond t 
until the time of failure and defined by the conditional random variable X — t\X > t. The mean 
residual life (MRL) function is an important function in survival analysis, actuarial science, 
economics and other social sciences and reliability for characterizing lifetime. Although the 
shape of the failure rate function plays an important role in repair and replacement strategies, 
the MRL function is more relevant as the latter summarizes the entire residual life function, 
whereas the former considers only the risk of instantaneous failure. In reliability, it is well 



known that the MRL function and ratio of two consecutive moments of residual hfe determine 
the distribution uniquely (Gupta and Gupta, [isl). 

MRL function as well as failure rate function is very important, since each of them can be 
used to determine a unique corresponding lifetime distribution. Lifetimes can exhibit IMRL 
(increasing MRL) or DMRL (decreasing MRL). MRL functions that first decreases (increases) 
and then increases (decreases) are usually called bathtub-shaped (upside-down bathtub), BMRL 
(UMRL). The relationship between the behaviors of the two functions of a distribution was 



studied by many authors such as Ghitany 



lH, Mi 0, Park and Tang et al. [3S|. 



The rth order moment of the residual life of the EWP distribution is given by the general formula 
mr{t) = E[{Y- ty\Y >t\ = ^^j^{y- tYf{y)dy, 



where S{t) = 1 — F{t), is the survival function. 
In what seen this onwards, we use the expressions 



oo 

X 



and 



where r''(t) = x^~^e~^dx is the upper incomplete gamma function and Ts{t) = x^~^e~^dx 
is the lower incomplete gamma function. 

The rth order moment of the residual life of the EWP distribution is given by 



X 



(«o-+i)-i)r(i+^)((fc + i)(^i)7), (17) 



where S{t) (the survival function of Y) is given in ([5]). 

For the EWP distribution the MRL function which is obtained by setting r = 1 in (jl7p . is given 
in the following theorem. 

Theorem 1. The MRL function of the EWP distribution with cdf ^ is given by 
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The second moment of the residual hfe function of the EWP distribution is 



m2{t) 



- m Ef=o Er=o(-i)' — ^ r(^"','^-')r('+^) {{k + imV)] } + 1\ 

7!(fc + l) T -I J 



The variance of the residual life function of the EWP distribution can be obtained using mi{t) 
and m2{t). 

On the other hand, we analogously discuss the reversed residual life and some of its properties. 
The reversed residual life can be defined as the conditional random variable t — X\X < t which 
denotes the time elapsed from the failure of a component given that its life is less than or equal 
to t. This random variable may also be called the inactivity time (or time since failure); for 



more details one may see Kundu and Nanda 



22l | and Nanda et al. f35|. Also, in reliability, the 



mean reversed residual life (MRRL) and ratio of two consecutive moments of reversed residual 
life characterize the distribution uniquely. Using @ and ([5]), the reversed failure (or reversed 
hazard) rate function of the EWP is given by 

~ F{y) ~ e.^d-e-iM-'r - 1 

The rth moment of the reversed residual life function can be obtained by the formula 

1 



^lr{t) = E[{t-YY\Y < t\ 



{t - yyf{y)dy. 



Hence, 



ad 



(a{j+l)-l\ 



(18) 



The mean and second moment of the reversed residual life of the EWP distribution can be 
obtained by setting r = 1 and 2 in (fTSl) . Also, using fj,i{t) and /X2(t) one can obtain the variance 
of the reversed residual life function of the EWP distribution. 



8. Probability weighted moments 

Probability weighted moments (PWMs) are expectations of certain functions of a random 
variable defined when the ordinary moments of the random variable exist. The PWMs method 
can generally be used for estimating parameters of a distribution whose inverse form cannot be 
expressed explicitly. 

11 



Estimates based on PWMs are often considered to be superior to standard moment-based esti- 
mates. They are sometimes used when maximum hkelihood estimates are unavailable or difficult 
to compute. They may also be used as starting values for maximum likelihood estimates. 

The PWMs method, which has been investigated by many researchers, was originally pro- 
posed by Greenwood et a _ 



purposes. Hosking et al. 



13]. Since then it has been used widely in practice and for research 
20y investigated the properties of parameters estimated by the PWMs 



method for the generalized extreme value (GEV) distribution using fairly long observed series 



and they gave a good summary of the PWMs method. Hosking [2l|] showed that the PWMs 
method is superior to the Maximum Likelihood (ML) method in parameter estimations when 
the extreme value distribution is used for longer return periods. 

In this paper we calculate the PWMs of the EWP distribution since they can be used for 
estimating the EWP parameters. For a random variable with the pdf /(.) and cdf F{.), the 
PWMs function are defined by 

Ts,r = E [X'F{Xr] = / x%Fix)yf{x)dx. 
Jo 

For the EWP distribution the PWMs are given by 



Using formula r„°° x'*'^* = . , , we have 



Remark 1. The sth moment of the EWP distribution can be obtained by putting r = in 
(jl9p . Therefore 

ae ~ + - + r(i + ^) 



s 5 



which is equal to Eq. (jlOp . if s is replaced by k. Also, the mean and variance of the EWP 
distribution can be obtained by setting s = 1 and 2 in above equation. 
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9. Mean deviations 

The amount of scatter in a population can be measured by tlie totality of deviations from the 
mean and median. The mean deviation from the mean is a robust statistic, being more resilient 
to outliers in a data set than the standard deviation. In the standard deviation, the distances 
from the mean are squared, so on average, large deviations are weighted more heavily, and thus 
outliers can heavily influence it. In the mean deviation from the mean, the magnitude of the 
distances of a small number of outliers is irrelevant. The mean deviation from the median is a 
measure of statistical dispersion. It is a more robust estimator of scale than the sample variance 
or standard deviation. It thus behaves better with distributions without a mean or variance, 
such as the Cauchy distribution. 

For a random variable X with pdf /(x), cdf -F(a;), mean /i and median M, the mean deviation 
from the mean and the mean deviation from the median are defined by 



5i{x)= / \x- ^i\f{x)dx = 2^lF{^x)-2I{^x), 

Jo 



and 



poo 

/ \x- M\f{x)dx = 2MF{M) - M + ti-2I{M), 
Jo 

respectively, where I{b) = xf{x)dx 

Theorem 2. The Mean deviations function of the EWP distribution are 

^ ^ j\{k-\-l) 'y 



xr(i+i)((A; + l)(/3/ir) 

and 



where fj, is given in 177]) and M can be obtained by setting p = .5 in 



10. Bonferroni and Lorenz curves 

Study of income inequality has gained a lot of importance over the last many years. Lorenz 
curve and the associated Gini index are undoubtedly the most popular indices of income inequal- 
ity. However, there are certain measures which despite possessing interesting characteristics have 
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not been used often for measuring inequality. Bonferroni curve and scaled total time on test 
transform are two such measures, which have the advantage of being represented graphically in 
the unit square and can also be related to the Lorenz curve and Gini ratio (Giorgi, |l5l|). These 
two measures have some applications in reliability and life testing as well (Giorgi and Crescenzi, 
[l^). The Bonferroni and Lorenz curves and Gini index have many applications not only in 
economics to study income and poverty, but also in other fields like reliability, medicine and 
insurance. 

For a random variable X with cdf F{.), the Bonferroni curve is given by 



Bf[F{x)] = —3— / uf{u)du. 
A'-f" l^J Jo 



From the relationship between the Bonferroni curve and the mean residual lifetime, the Bonfer- 
roni curve of the EWP distribution is given by 



xr(,+i)((A; + l)(/3xr) 
The Lorenz curve of the EWP distribution can be obtained via the expression 

L,[F{x)] = B,[F{x)]F{x) = ^^Er=oEr=o(-l)'^— ^r'".^^"') 
xr(i+i)((A; + l)(/3xr)], 
where \x is the mean of the EWP distribution. 

The scaled total time on test transform of a distribution function F is defined by 



1 



SF\F{t)\ = - / S{u)du. 
fJ- Jo 

If F{t) denotes the cdf of the EWP distribution then 

The cumulative total time can be obtained by using formula Cp = SF[F{t)]f{t)dt and the 
Gini index can be derived from the relationship G = 1 — Cp. 

11. Estimation and inference 

In this section, we discuss the estimation of the parameters of the EWP distribution. Let 
Yi,Y2, ■ ■ ■ ,Yn he a random sample with observed values yi, 1/2, • ' ' > 2/n from EWP distribution 
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with parameters a,(3,j and 6. Let O = {a, I3,'j,0)^ be the parameter vector. The total log- 
Ukehhood function is given by 

In = ln{y; @) = -n log[e^ - 1] + n[log 6* + log a + log 7 + 7 log P] 

+ (7 - 1) EILi log y^ - U=liPy^r + («-!) EILi iog(i - e-^^^'^' ) 

The associated score function is given by Un{@) = {dln/da,dln/dp,dln/d'j,dln/d9)'^ , where 



dlr,. 

da 

din 
9/3 



din 
97 



din 

de 



= 1 + E?=i log(l - e-(^^^)^) + e Er=i log(l - e-(^^^)^)(l - e-(^^^)^)«, 



= f - 7/?^-^ Er=i yZ + (« - 1)7^^-^ E?=i H 

+ 07a;3^-i ELi yje-^^y^'^^i - e-^^y^)'Y-\ 
= ^ + n log /3 + Er=i log - ELi iog(/3y*) 

= UEr=i(i-e-('^^'^'r 



1-e- 



The MLE of 0, say 0, is obtained by solving the nonlinear system C/„ (0) = 0. The solution of 
this nonlinear system of equation has not a closed form. For interval estimation and hypothesis 
tests on the model parameters, we require the information matrix. The 4x4 observed information 
matrix is 



In (O) = - 



laa lap ^a-y ^aO 
Ial3 1^/3 Ipj Ipe 
lay -^77 Iy6 



whose elements are given in Appendix. 

Applying the usual large sample approximation, MLE of Q i.e. can be treated as being 
approximately A^4(0, J„(0)^^), where (0) = -E[/„(0)]. Under conditions that are fulfilled 
for parameters in the interior of the parameter space but not on the boundary, the asymptotic 
distribution of v^(0-0) is iV4(0, J(e)~^), where J (0) = lim 

n-^oof^ Ini&) is the unit infor- 
mation matrix. This asymptotic behavior remains valid if J(0) is replaced by the average sample 
information matrix evaluated at 0, say n~^/„(0). The estimated asymptotic multivariate nor- 
mal A^4(0, /n(0) ^) distribution of can be used to construct approximate confidence intervals 
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for the parameters and for the hazard rate and survival functions. A 100(1 — 7) asymptotic 
confidence interval for each parameter is given by 



ACIr = {Qr - ZiVI"\er + ^xV I 

where 1^^ is the (r, r) diagonal element of /n(0) for r = 1, 2, 3, 4, and Z2 is the quantile 
1 — 7/2 of the standard normal distribution. 

11.1. EM- algorithm 

The MLEs of the parameters a, /3, 7 and 6 in previous section must be derived numerically. 
Newton- Raphson algorithm is one of the standard methods to determine the MLEs of the pa- 
rameters. To employ the algorithm, second derivatives of the log-likelihood are required for all 
iterations. The EM-algorithm is a very powerful tool in handling the incomplete data problem 
(Dempster et al., [l^; McLachlan and Krishnan, [29]). It is an iterative method by repeatedly 
replacing the missing data with estimated values and updating the parameters. It is especially 
useful if the complete data set is easy to analyze. 

Let the complete-data be Yi, • • • with observed values yi, • • • ,yn and the hypothetical 
random variable Zi, - ■ ■ ^Z^. The joint probability density function is such that the marginal 
density of Yi, • • • ,1^ is the likelihood of interest. Then, we define a hypothetical complete-data 
distribution for each (1^, Zj) i = 1, • • • , n, with a joint probability density function in the form 

<7(y,z;e) = /(2/|z)/(z) = ^|(J_-^) ^«7/3^?/^-'e-(^^)^(l - e-^f^v)'' Y'^-\ 

where 6 = (a, /3, 7, 6*), y > and z G N. 

Under the formulation, the E-step of an EM cycle requires the expectation of {Z\Y; @^^^) where 
©('■) = (^a^^\ j3^'^\j(^\0^'^^) is the current estimate of (in the rth iteration). 
The pdf of Z given Y, say g{z\y) is given by 

9{z\y) = , 

with the expectation 

E[Z\Y = y] =l + e{l-e-^^y^'')'^. 

The EM cycle is completed with the M-step by using the maximum likelihood estimation 
over 0, with the missing Z's replaced by their conditional expectations given above. 
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The log-likelihood for the complete-data is 

Inivir-- ,yn]Zi,--- ,z;G) oc J27=i ^i'^ogO - nlog{e^ - 1) + nloga + nlog7 + 717 log/3 

+(7 - 1) EILi logyi - Er=i(/3y.)^ + Er=i(^*« - 1) iog(i - e-^^^')' 

The components of the score function U*{Q) = ^)"^ are given by 

§ =S + Er=i^aog(i-e-(/^^^^r), 

f =^+niog/3+ ELi log^. - Er=i(/3^o^ logm) + Er=i(^.« - 1) (^^'^75?,;;-;'^"'' , 

~ e 1^1=1 ^* i-e-"- 

From a nonlinear system of equations U*{Q) = 0, we obtain the iterative procedure of the 
EM-algorithm as 



n + nlog/3W + Er=i logy. - Er=i log(/3W2/.)(;3W2 



n ri 

§(t+i) i_e-9(*+i) ^' 

where a*^*"''^\ /S'-*"'"^-* and ^(*+^) are found numerically. Hence, for i = 1, • • • ,re, we have that 

12. Sub-models of the EWP distribution 

The EWP distribution contains some sub-models for the special values of a, /3 and 7. Some 
of these distributions are discussed here in details. 

12.1. The CWP distribution 

The CWP distribution is a special case of EWP distribution for a = 1. Our approach here 
is complementary to that of Morals and Barreto-Souza 311] in introducing the Weibull-Poisson 
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(WP) distribution. The pdf, cdf and hazard rate function of the CWP distribution are given, 
respectively by 

e(l_e-(/3y)^) _ -, 

F{y) = - 



and 



h{y) 



- 1 



(.e _ ge{i-e-(Pyr') 

According to (|lip and (112p . the mean and variance of the CWP distribution are given, respec- 
tively by 



n=l j=0 



and 



where -E'(i^) is given by Eq. (j20p . One can obtain the Weibull distribution from the CWP 
distribution by taking to be close to zero. 

12.2. The GEP distribution 

GEP distribution is a special case of the EWP distribution for 7=1. This distribution is 
introduced and analyzed in details by Mahmoudi and jafari The pdf, cdf and hazard rate 
function of the GEP distribution are given, by 

and 

~ _ gSCl-e-zS!/)" ' 

respectively. The mean and variance of GEP distribution are 

/na — 1 \ 

-^'^^ - /3(e8-l) ^n=l Z^i=0l -^J (n-1)! (j + l)^ 

E^=i^V'(na + l)-|V'(l), 



(21 
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and 



/ TiQ — 

^^'(1) - ^.(^ E:Li ^ {i^'ina + 1) + (^(na + 1) - ^(1))^) - E\Y), 



where E(Y) is given in ()2ip . is the digamma function and is its derivative. 

12.3. The CEP distribution 

The CEP distribution is another special case of EWP distribution obtained by replacing 
a = 7 = 1, in EWP distribution. Cancho et al. [9] introduced this distribution as the name 
Poisson-exponential distribution. This distribution has an increasing failure rate and arises on 
a latent complementary risk problem base. The pdf, cdf and hazard rate function of the CEP 
distribution are given, respectively by 

fiy) = ^6-/^^6^(1—"^), 

e{i-e-Py) _ 1 

F{y) = ^- , 

e'^ — 1 

and 

The mean and variance of the CEP distribution are given by 

E(Y) - ^ y^n-l/ ..j ("/) 

) - /3(e»-l) Z^n=l 2^j=0\ ^) (n-iy.Jj+W 



(22) 



and 

= ^^P'il) - En=i ^ [i^'in + 1) + {^Pin + 1) - ^(1))^) - E'{Y), 

respectively, where E{Y) is given in Eq. (j22p 

13. Applications of the EWP distribution 

To show the superiority of the EWP distribution, we compare the results of fitting this 
distribution to some of theirs sub-models such as EW, GE and Weibull distributions, using 
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Scaled TTT transform 



Kaplan-Meier Curve 




Figure 3: TTT plots and Kaplan-Meier curves of strengths of 1.5 cm glass fibres and the failure times in hours 
data. 

two real data sets. The required numerical evaluations are implemented using the SAS and R 
softwares. The empirical scaled TTT transform (Aarset, and Kaplan-Meier curve can be 
used to identify the shape of the hazard function. 

The first data consists of 63 record of the strengths of 1.5 cm glass fibres, measured at 
the National Physical Laboratory, England. Barreto-Souza et al. Q] used this data to fit the 
beta-generalized exponential (BGE) distribution. The TTT plot and Kaplan-Meier curve for 
the first data in Fig. 3 shows an increasing hazard rate function and, therefore, indicates that 
appropriateness of the EWP distribution to fit this data. Table 1 lists the MLEs with the 
standard deviations of the parameters, the values of K-S (Kolmogorov-Smirnov) statistic with 
its respective p-value, -21og(L), AIC (Akaike Information Criterion), AD (Anderson-Darling 
statistic) and CM (Cramer-von Mises statistic) for the first data. These values show that the 
EWP distribution provide a better fit than the EW, GE and Weibull for fitting the first data. 

We apply the AD and CM statistics, in order to verify which distribution fits better to this 
data. The AD and CM test statistics are described in details in Chen and Balakrishnan 11]. 



In general, the smaller the values of AD and CM, the better the fit to the data. According to 
these statistics in Table 1, the EWP distribution fit the first data set better than the others. 

Using the likelihood ratio (LR) test, we test the null hypothesis HO: EW versus the alter- 
native hypothesis HI: EWP, or equivalently, HO: 6 = versus HI: 6^0. The value of the 
LR test statistic and the corresponding p- value are 3.4 and 0.0651, respectively. Therefore, the 
null hypothesis (EW model) is rejected in favor of the alternative hypothesis (EWP model) 
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Table 1: MLEs(stds.), K-S statistics, p-values, — 21og(L) and AIC for the strengths of 1.5 cm glass fibres. 



UlSt. 


1\ /TT TZn^/Ari-J^ \ 

MLrjS^stcls.j 




K-b 


p-value 


-21og(L) 


AiO 


AD 




EWP 


a — 0.578(0.3424), /3 
7 = 5.502(1.395), = 


= 0.647(0.0508) 
2.782(1.568) 


0.1154 


0.3731 


26.0 


34.0 


0.682 


0.195 


EW 


a = 0.671(0.2489),/? 
7 = 7.285(1.707) 


= 0.582(0.0292) 


0.1901 


0.0211 


29.4 


35.4 


1.086 


0.279 


GE 


a=31.349(9.519), /3= 


=2.612(0.2380) 


0.2291 


0.0027 


62.8 


66.8 


4.340 


0.881 


WeibuU 


^=0.614(0.0140), 7= 


:5. 781(0.5761) 


0.1589 


0.0830 


30.4 


34.4 


3.135 


0.297 



for a significance level > 0.0651. For test the null hypothesis HO: GE versus the alternative 
hypothesis HI: EWP, or equivalently, HO: (7,^) = (1,0) versus HI: (7,^) 7^ (1>0), the value 
of the LR test statistic is 36.8 (p-value = le-08), which includes that the null hypothesis (GE 
model) is rejected in favor of the alternative hypothesis (EWP model) for any significance level. 
We also test the null hypothesis HO: Weibull versus the alternative hypothesis HI: EWP, or 
equivalently, HO: (a,^) = (1,0) versus HI: (a,^) / (1,0). The value of the LR test statistic is 
4.4 (p-value = 0.1108), which includes that the null hypothesis (Weibull model) is rejected in 
favor of the alternative hypothesis (EWP model) for a significance level > 0.1108, and for any 
significance level < 0.1108, the null hypothesis is not rejected but the values of AD and CM 
in Table 1 show that the EWP distribution gives the better fit to the first data set than the 
Weibull distribution. 

As a second application, we consider the data show the stress-rupture life of kevlar 49/epoxy 
strands which were subjected to constant sustained pressure at the 90 stress level until all had 
failed. The failure times in hours are shown in Andrews and Herzberg i3] and Barlow et al. 
The TTT plot and Kaplan-Meier curve for this data in Fig. 3 shows bathtub-shaped hazard 
rate function and, therefore, indicates that appropriateness of the EWP distribution to fit this 
data. The MLEs with the standard deviations of the parameters, the values of K-S statistic, 
p- value, -21og(L), AIC, AD and CM are listed in Table 2. From these values, we note that the 
EWP model is better than the EW, GE and Weibull distributions in terms of fitting to this 
data. 

Plots of the estimated pdf, cdf and survival function of the EWP, EW, GE and Weibull 
models fitted to the data sets corresponding to Tables 1 and 2, respectively, are given in Fig. 
4. These plots suggest that the EWP distribution is superior to the EW, GE and Weibull 
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Tabic 2: MLEs(stds.), K-S statistics, p-values, — 21og(L) and AIC for the strengths of 1.5 cm glass fibres. 



Dist. 


TV ITT T~^ / J- J \ 

MLEs(stds.) 




K-S 


1 

p- value 


-21og(L) 


AIC 


AD 


CM 


EWP 


a = 0.859(0.3681), p 
7 = 0.8717(0.2409),^ 


~ 1.303(0.7398) 
= 1.266(1.2007) 


0.0725 


0.6640 


204.6 


212.6 


0.841 


0.218 


EW 


a = 0.793(0.2870),^ 
7 = 1.060(0.2399) 


= 0.821(0.2651) 


0.0844 


0.4677 


205.6 


211.6 


0.955 


0.247 


GE 


a=0.866(0.1098), /3= 


0.888(0.1201) 


0.0887 


0.4041 


205.6 


209.6 


1.021 


0.263 


Weibull 


^=1.010(0.1141), 7= 


0.926(0.0726) 


0.0907 


0.3777 


206.0 


210.0 


1.224 


0.2789 



distributions in fitting these two data sets. 
14. Conclusion 

A new four-parameter distribution called the EWP distribution is introduced. This dis- 
tribution is a generalization of the EW distribution and contains several lifetime sub-models 
such as: GEP, CWP, CEP, ERP and RP. A characteristic of the EWP distribution is that its 
failure rate function can be decreasing, increasing, bathtub-shaped and unimodal depending on 
its parameter values. Several properties of the new distribution such as its probability density 
function, its reliability and failure rate functions, quantiles and moments is obtained. The maxi- 
mum likelihood estimation procedure via a EM-algorithm is presented. Fitting the EWP model 
to two real data sets indicates the flexibility and capacity of the proposed distribution in data 
modeling. In view of the density and failure rate function shapes, it seems that the proposed 
model can be considered as a suitable candidate model in reliability analysis, biological systems, 
data modeling, and related fields. 
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Figure 4: Fitted pdf and cdf of the EWP, EW, GE and Weibull distributions for the data sets corresponding to 
Tables 1 and 2, respectively. 
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Appendix 

The elements of 4 x 4 observed information matrix (G) are given by 

laa = + ^ELi (i - e-(f^y^yyiog (1 - e-(f^y^rf , 

+ Er=i y^e-(^^^)^ (l - e-^^y^y) log (l - e-^^y^y) {Py^V , 
lae = ELi (1 - ^-^^''^1 " log (1 - ^-^^"''^1 . 

la, = Er=i ^^'"^^yfy^ + e e-^f^y^y (i - e-^^y^y) -1 log iPm) ipy,) ^ 

+ «^Er=i e'^^''^'' (l - e-(^y^yy log (l - e-(^y^y) log i/3yi) (/3y,)^ , 

//3/3 =-i^-7(7-i)Er=iy'(^yO^-' + (a-i)7(7-i)Er=iyr"^'"'" 



+ a9ji^ - 1) Er=i yh-^^'''>' (1 - e-^^^'^') iPyi) 

+ a(a - 1)^7' EILi e-'(^^^)' (l - e-(^^^)^) (^y^) 2(7-1)^ 



= - EILi - 7 ELi log (/^y^) + - 1) ELi --^^^^^^^^^ 

log(/3i/i)(/ 



l_e-(^ 

-t- la -Lj7z^i=iyi 1 „-f/3w,n v« '-n2^i=iyi a .-f/3«,i7\2 



+ «^7Er=i y«e-(^^*)' (1 - e-(f^y^y) log iPyi) {Py^) 

- a^7Er=i y^e-^^y^y (1 - e-(^2/i)-) a-i bg (/Sy^) 27-1 

+ a{a - 1)^7 E-=i y^e-2(/32/,)- (1 _ ^-^V) a-2 (^^.^ (^^.^ 27-1^ 
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7"„„ ne n 

~ {e-»-lf ~ ^' 



I,, =-^- ELi log {f^y^) iPy^r + («-!) Er=i 



, a-1 



+ «^Er=ie-(''^^)' (l-e-(/32/ir)"~ log(/3yO'(/3yO^ 
- «^Er=i e-^''^^)'' (l - e-if'y^yy log (/3yO' 
+ «(« - 1)^ Er=i (1 - e-(^^^)^)"-' log {/3y,f {^y,f^ . 
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